NP chunking using ILP
نویسنده
چکیده
This is to report the results of approaching the problem of NP chunking using Inductive Logic Programming techniques. The problem, as de-ned in (Ramshaw and Marcus, 1995), is the machine learning of rules that recognise non-recursive, base NPs in text annotated with part-of-speech tags, by tagging each word as beingìnside' oròutside' an NP. (Consecutive NPs are appropriately treated.) The same input data as in the original experiment is used here, but the machine learning technique is Inductive Logic Programming, and more speciically the Progol algorithm. The problem is formulated as the machine learning of a Prolog predicate that will accept a part-of-speech tagged word and its context as input and associate it with the appropriate syntactic tag.
منابع مشابه
BaseNP Chunking using ILP
This is to present work on applying Inductive Logic Programming (ILP) to the task of BaseNP chunking. After ILP and NP Chunking are discussed, the experimental setup for using ILP to construct a BaseNP tagger in Prolog is described. Finally, the results are quantitatively as well as qualitatively analysed.
متن کاملImproving Scalability of Inductive Logic Programming via Pruning and Best-Effort Optimisation
Inductive Logic Programming (ILP) combines rule-based and statistical artificial intelligence methods, by learning a hypothesis comprising a set of rules given background knowledge and constraints for the search space. We focus on extending the XHAIL algorithm for ILP which is based on Answer Set Programming and we evaluate our extensions using the Natural Language Processing application of sen...
متن کاملAn Empirical Study of Vietnamese Noun Phrase Chunking with Discriminative Sequence Models
This paper presents an empirical work for Vietnamese NP chunking task. We show how to build an annotation corpus of NP chunking and how discriminative sequence models are trained using the corpus. Experiment results using 5 fold cross validation test show that discriminative sequence learning are well suitable for Vietnamese chunking. In addition, by empirical experiments we show that the part ...
متن کاملA Supervised Learning based Chunking in Thai using Categorial Grammar
One of the challenging problems in Thai NLP is to manage a problem on a syntactical analysis of a long sentence. This paper applies conditional random field and categorical grammar to develop a chunking method, which can group words into larger unit. Based on the experiment, we found the impressive results. We gain around 74.17% on sentence level chunking. Furthermore we got a more correct pars...
متن کاملNP Alignment in Bilingual Corpora
We created a simple gold standard for English-Hungarian NP-level alignment, Orwell’s 1984, (since this already exists in manually verified POS-tagged format in many languages thanks to the Multex and MultexEast project) by manually verifying the automaticaly generated NP chunking (we used the yamcha, mallet and hunchunk taggers) and manually aligning the maximal NPs and PPs. The maximum NP chun...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999